[RFC] - Config is code #512

felipemello1 · 2025-10-30T19:53:00Z

This PR attempts to showcase the different styles we could use for our config system. A push here is if we can leave the .yaml and use .py instead, under the reasoning that "config is code".

Python can enable a) better type safety, b) import directly instead of using strings, and c) more flexibility for the user to define custom code within the config.

On the other hand, yaml can be easier to read and hydra provides some nice freebies.

Must have:

Lazy instantiation: Currently we don't have any way to instantiate a function through configs, e.g. 'loss: my_loss_fn'.
Handle complex patterns, e.g. nesting

Nice to have:

Composition: Our code is naturally fragmented (generator, rewards, etc). Touching one config without touching the others is a relevant feature.
Ability to code in the config, e.g. if statements, sum values, check conditions, etc.
CLI overrides

Proposed options:

OmegaConfg / Hydra
Fiddle (cfg option by google)
Factory + dataclasses
Plain python with partials
Plain python with dictionary
TODO: toml

I would like to hear what others think about the options, if we should stick with yaml and fully explore hydra/omegaconf or change to .py system.

This PR showcases different styles for our config system, exploring the "config is code" philosophy. Implements 5 approaches: 1. OmegaConf/Hydra (YAML-based) 2. Fiddle (Google's config library) 3. Factory + dataclasses 4. Plain Python with functools.partial 5. Plain Python with dictionaries Each approach demonstrates handling of: - Lazy instantiation - Nested component instantiation - Partial application for runtime args - Config composition and overrides

tianyu-l

Thanks for putting up these candidates! Left some initial impressions, might not be accurate.

tianyu-l · 2025-11-03T23:39:29Z

brainstorming/configs/config_partial.py

+    inner_partial = cfg2["model"].keywords["attn_config"]
+    inner_partial.keywords["num_heads"] = 64
+    model2 = cfg2["model"]()


definition part is OK, but its consumption is confusing when it comes to nested init

tianyu-l · 2025-11-03T23:40:39Z

brainstorming/configs/config_dicts.py

+        return base
+
+    cfg_variant = llama3_2_1b_large_lr()
+    attn_config_variant = cfg_variant["model"]["kwargs"]["attn_config"]["cls"](


seems too flexible, hard to read, and error-prone

agreed, its horrible. I put it here to make a contrast

tianyu-l · 2025-11-03T23:45:10Z

brainstorming/configs/config_dataclasses.py

+# LICENSE file in the root directory of this source tree.
+
+"""
+Dataclass config with inner Config classes.


can we get free cli override with tyro?
https://github.com/pytorch/torchtitan/blob/2ea6197b957936bdd4941e59a000cf31987a3184/torchtitan/config/manager.py#L56

that looks nice, didnt know about it

tianyu-l · 2025-11-03T23:58:10Z

brainstorming/configs/config_hydra.py

+    cfg2.optimizer.lr = 1e-4
+    optimizer_partial2 = instantiate(cfg2.optimizer)
+    optimizer2 = optimizer_partial2(params=model2.parameters())


This looks a bit confusing.

agreed, perhaps there is a better way? Not sure if i can pass the params already in instantiate.

The spirit here is that i can actually do once:
instantiate(cfg), and everything gets instantiated already at the very start, but when i call cfg.optimizer, i will get a partial.

In this example it feels weird because instead of doing `instantiate(config)```, i am instantiating each arg of the config individually

tianyu-l · 2025-11-04T00:03:28Z

brainstorming/configs/config_hydra.py

+- Lazy instantiation via hydra.utils.instantiate
+- Command-line override for free (--optimizer.lr=1e-4)
+
+Cons:


plus learning curve, compose, instantiate, etc.

in practice, i think that people would only use instantiate, and we can even get rid of this and instantiate the whole config once at the start. Partials can then be called where needed. e.g.

def main(path_cfg:str): cfg = instantiate(load_cfg(path_cfg)) model = cfg.model optimizer = cfg.optimizer(params=model.param)

tianyu-l · 2025-11-04T00:05:20Z

brainstorming/configs/config_dataclasses.py

+def llama3_2_1b_full():
+    output_dir = "/tmp/torchtune/llama3_2_1B/full"
+
+    return {


For this one and the fiddle one, I guess you can take this to extreme and make everything a (data)class, e.g. TrainerWithConfig class can have model, optimizer, data loader, etc. What's your thought?

My issue with this pattern of TrainerWithConfig.config() is that its too opinionated and impacts the entire codebase. 3rd party utilities, or local code that dont need to be a class, must be represented as such.

Between the two, i personally prefer fiddle.

But when i read config_fiddle.py and i read baseline.yaml, to my eyes, baseline.yaml is easier to parse and understand. Perhaps because its a simple config? I can imagine cases where using python can be handy.

On the composability side, if you look at baseline_different_bsz.yaml, it seems easier to abstract away experimentation from infra args.

TDLR

despite all of the hate, IMO .yaml + OmegaConf and/or Hydra is the lesser of all evils.

If we want to use .py, fiddle seems the easiest

dataclasses.config pattern are the safest, but impact the entire code base and is harder to read

felipemello1 requested a review from allenwang28 October 30, 2025 19:53

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 30, 2025

felipemello1 force-pushed the config_is_code branch from ec6a892 to 55c308d Compare October 30, 2025 19:57

tianyu-l reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] - Config is code #512

[RFC] - Config is code #512

felipemello1 commented Oct 30, 2025 •

edited

Loading

Uh oh!

tianyu-l left a comment

Uh oh!

tianyu-l Nov 3, 2025

Uh oh!

tianyu-l Nov 3, 2025

Uh oh!

felipemello1 Nov 4, 2025

Uh oh!

tianyu-l Nov 3, 2025

Uh oh!

felipemello1 Nov 4, 2025 •

edited

Loading

Uh oh!

tianyu-l Nov 3, 2025

Uh oh!

felipemello1 Nov 4, 2025 •

edited

Loading

Uh oh!

tianyu-l Nov 4, 2025

Uh oh!

felipemello1 Nov 4, 2025 •

edited

Loading

Uh oh!

tianyu-l Nov 4, 2025

Uh oh!

felipemello1 Nov 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[RFC] - Config is code #512

Are you sure you want to change the base?

[RFC] - Config is code #512

Conversation

felipemello1 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tianyu-l Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

felipemello1 commented Oct 30, 2025 •

edited

Loading

felipemello1 Nov 4, 2025 •

edited

Loading

felipemello1 Nov 4, 2025 •

edited

Loading

felipemello1 Nov 4, 2025 •

edited

Loading

felipemello1 Nov 4, 2025 •

edited

Loading